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Abstract 

This paper continues a series of studies devoted to analysis of the bi- 
variate probability distribution V(x, y) of two consecutive price increments x 
(push) and y (response) at intraday timescales for a group of stocks. Besides 
the asymmetry properties of V(x, y) such as Market Mill dependence pat- 
terns described in preceding paper PQ| , there are quite a few other interesting 
geometrical properties of this distribution discussed in the present paper, 
e.g. transformation of the shape of equiprobability lines upon growing dis- 
tance from the origin of xy plane and approximate invariance of V(x, y) with 
respect to rotations at the multiples of n/2 around the origin of xy plane. 
The conditional probability distribution of response V(y\x) is found to be 
markedly non-gaussian at small magnitude of pushes and tending to more 
gauss-like behavior upon growing push magnitude. The volatility of V(y\ x) 
measured by the absolute value of the response shows linear dependence on 
the absolute value of the push, and the skewness of V(y\ x) is shown to in- 
herit a sign of the push. The conditional dynamics approach applied in this 
study is compared to regression models of AR-ARCH class. 
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1 Introduction 



Intensive investigations over recent decades have revealed statistically signif- 
icant deviations from an assumption of independent identically distributed 
(IID) increments that underlies the random walk model of stock price dy- 
namics. A direct statistical evidence showing significant deviations from IID 
using a BDS test [2] was presented in [3J. The rejection of the IID hypothesis 
also follows from the volatility - based test described in There is a number 
of dependence patterns showing themselves in such stylized facts as statisti- 
cally significant autocorrelations at intraday timescales, volatility clustering, 
leverage effect, etc. |H 03 El El , as we ^ as correlations between simultaneous 
increments of different stocks. Each of these effects corresponds to some sort 
of probabilistic dependence between the lagged and/or simultaneous price 
increments and their moments. 

In a series of papers [U El El including the present one we study further 
evidence of the presence of dependence patterns in financial time series. An 
approach we apply is based on the direct analysis of the multivariate prob- 
ability distributions of simultaneous and lagged price increments for both 
single stocks and a basket of stocks. A simplest case we concentrate upon is 
that of the bivariate distribution describing the interdependence of two price 
increments in two coinciding or non-overlapping time intervals. We also con- 
sider a natural and transparent interpretation of the bivariate probability 
distribution provided by its sections corresponding to the fixed value of one 
of the variables, i.e. conditional distributions. 

Despite their fundamental importance, multidimensional probability dis- 
tributions of stock price returns (increments) are, to the best of our knowl- 
edge, not widely used. The bivariate distribution of returns in two consecu- 
tive intervals was analyzed, in the particular case of Levy-type marginals, in 
|10j . where some interesting geometric features of this distribution both for 
the case of independent and dependent returns were described. As discussed 
in [TT], the bivariate distribution in question can be considered as a "fin- 
gerprint" reflecting the nature of the pattern embracing the two consecutive 
returns. Let us also mention the "compass rose" phenomenon ^21 GUI E] 
and the discussion of return predictability in and Another line of 
research is an explicit analytical reconstruction of the bivariate distribution 
in question using copulas, see e.g. [121 EH UZ1 EE]- There is a few stud- 
ies devoted to a direct analysis of the conditional distributions. Recently 
an analysis of volatility dynamics exploiting such conditional distributions 
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was described in ^J]- The first moment of the corresponding conditional 
distribution for daily time intervals was studied in [2*U] . 

At the same time the main focus of the efforts to quantify the conditional 
dynamics of financial instruments was on constructing and studying the re- 
gression models, see e.g. [HI [211 [221 [231 [2E1 [2Z1 I2H1 [2H1 EH] Each 
of these models realizes a particular version of the conditional dynamics, in 
which parameters of the conditional distribution describing the forthcoming 
increment depend on lagged increments and moments. In the simplest ver- 
sion of ARCH model [HJ 1221 this conditional distribution is gaussian with 
a standard deviation depending on the magnitude (s) of one or more lagged 
increments. In G ARCH models (HI 123 the conditional standard deviation de- 
pends on lagged standard deviation as well. As such conditionally gaussian 
approach did not allow to describe an observed degree of fat-tailedness of the 
increments, further development included considering fat-tailed conditional 
distributions [23] and, in modern versions j23 12H1 12H EH] , the fat-tailed and 
skewed conditional distributions in which fat-tailedness and skewness are con- 
ditional as well. A case of nonlinear dependence of the conditional mean of 
forthcoming increment on lagged increments could most naturally be treated 
within a class of threshold autoregressive models |21j . 

An approach based on constructing regression type models is, by defini- 
tion, a parametric one. Assuming some specific form of increment dynamics 
one runs statistical tests to determine optimal values of the parameters in 
question. Our approach |SJ Ej is, on contrary, inherently non-parametric. 
We do not use any assumptions on the particular form of probabilistic links 
between price increments. The analysis of price dynamics is made in terms 
of direct examination of the observed multivariate probability distributions. 

In our study of dependence patterns in stock price dynamics [8. a direct 
examination of the moments of the bivariate distributions linking consecutive 
returns of a stock and simultaneous returns of a pair of stocks was performed. 
It was shown that some empirical features of the bivariate distribution in 
question, e.g. conditional volatility smile, result from non-gaussian nature of 
the distirbution. 

In the preceding paper |T we analyze the asymmetry properties of the 
bivariate probability distribution V(x, y) of two consecutive stock price in- 
crements x (push) and y (response) resulting in remarkable market mill de- 
pendence patterns. In the present paper we continue the analysis of [TJ [S] 
by a more close inspection of the properties of the full bivariate distribu- 
tion V(x,y) of increments and the conditional distribution of the response 
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V(y\x). Therefore, despite the fact that we do not discuss specifically the 
market mill properties in the present paper, this paper still belongs to the 
series of studies [TJ |HJ E] under the umbrella title - " Market mill dependence 
pattern 

The paper is organized as follows. In paragraph 2.1 we describe the 
dataset and the probabilistic methodology used in our analysis. 

A detailed description of the results is given in paragraph 2.2. In 2.2.1 
we study geometrical properties of the full bivariate probability distribution 
V(x,y). First we show that the shape of the equiprobability lines of the 
distribution V(x,y) changes upon growing distance from the origin of xy 
plane. Another property of V(x, y) is its approximate invariance with respect 
to rotations at the multiple of 7r/2 around the origin of xy plane. In 2.2.2 
we switch to studying properties of the conditional probability distribution 
V(y\ x) and find that the volatility of V(y\ x) measured by the absolute value 
of the response shows linear dependence on the absolute value of the push, 
and the skewness of V(y \ x) inherits a sign of the push. 

In the discussion section of the paper we first concentrate on non-gaussian 
properties of both the full bivariate distribution V(x, y) and conditional one 
V(y\ x) (see paragraph 3.1). We discuss that the conditional distribution 
V(y\x) tends to more gauss-like behavior upon growing magnitude of the 
push. 

Our studies of the bivariate distribution are, by default, also studies of 
a special version of the conditional dynamics in which information on the 
value of a price increment is fully accounted for in describing the probabilistic 
pattern characterizing the next increment. It is therefore of direct interest 
to compare our results with the ones obtained within the regression model 
approach. This is done in section 3.2 of the paper. 

The paragraph 4 finalizes the paper by describing main conclusions and 
outlook of the future studies. 

2 Properties of push - return distribution 

An analysis of the properties of the push - return distribution described in 
the present paragraph continues the study initiated in pQ. It goes into further 
details in describing the unique properties of the geometry of the bivariate 
probability distribution under consideration and analyzes moments of the 
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corresponding conditional distribution 2 . 



2.1 Data and methodology 

Our study of high frequency dynamics of stock prices is based on data on the 
prices of 100 stocks traded in NYSE and NASDAQ in 2003-2004 sampled at 
1 minute frequency 3 . 

Let us consider two non-overlapping time intervals of length ATi and AT2, 
where the interval AT 2 immediately follows after ATi. We shall denote the 
price increment in the first interval p(ti+ ATi) — p(^i) (push) by x and that in 
the second one p(t 2 + AT 2 ) — p(i 2 ) (response) by y. In this study we consider 
ATi — AT 2 = 1 min., 3 min., 6 min. The full probabilistic description in 
xy plane is given by the bivariate probability density V(x,y). An analysis 
of the full bivariate distribution V(x, y) is often facilitated by considering 
its cross-sections corresponding to conditional distributions such as, e.g., the 
conditional distribution of response at given push V(y\ x) = V(x, y)/V{x). 

Let us stress that our data set combines pairs of price increments for 
all stocks belonging to the specified group (see Appendix), so that a set of 
events (pairs of increments) unifies all subsets of events characterizing indi- 
vidual stocks. A further detailed study of general features of the probability 
distributions V(x,y) and V(y\ x) constitutes the main topic of the present 
paper and comprehends the analysis of pQ . 

Let us note that analogously to pQ i n our analysis we use price increments. 
The results obtained using returns are in qualitative agreement to the ones 
discussed in the present paper. 

2.2 Structure of the bivariate distribution 

Let us discuss properties of the bivariate distribution V(x,y). 

2 Although an analysis of the properties of the bivariate probability distribution and 
the corresponding conditional distribution presented in the present paper is self-contained, 
we strongly refer to £Q for a more comprehensive picture. 

3 The list of stocks is given in the Appendix 
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2.2.1 Global two-dimensional geometry. Invariance with respect 
to rotations. 



The two-dimensional projection of \og 8 V(x,y) in case of two adjacent 1- 
minute intervals is shown in Fig. 1. To facilitate a discussion of some qualita- 
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Figure 1: Logarithm of two-dimensional distribution \og 8 V(x,y), AT = 1 
min. 

tive features seen in Fig. 1, let us sketch profiles of the equiprobability levels 
of V(x, y) in Fig. 2, where the xy plane is divided into sectors numbered 
counterclockwise from I to VIII. The shape of equiprobability lines shown 
in Figs. 1,2 can be described as a superposition of a basic regular pattern, 
rhomboid in the vicinity of the origin and circular away from it, perturbed 
in such a way that each of the even sectors (II,IV,VI,VIII) contains more 
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Figure 2: Sketch of the equiprobability levels of the bivariate distribution 
V(x, y). The basic regular symmetric structure is shown by brown lines, the 
actual structure - by blue ones. 

probability density than each of the odd ones (sectors I, III, V, VII). The ge- 
ometry of unperturbed basic regular pattern (shown in brown in Fig. 2) can 
be described as 

\ X \ a + \y\<* = const, (1) 

where a ~ 1 in the vicinity of the origin and a ~ 2 far away from it. 

An interesting property of the bivariate distribution V(x,y) is its ap- 
proximate invariance with respect to rotations at the multiples of ir/2. The 
distribution geometry is quite nontrivial: as already mentioned, all even sec- 
tors contain more probability density than the odd ones 4 . In terms of sample 
paths (pairs of increments) composed by increments ±d and ±£ 2 the exact 
symmetry with respect to rotations at multiples of n/2 leads to the following 
chain of equalities: 

P(Ci i C 2 )=P(-C 2 , d) = V{-Ci, -C 2 ) = V{( 2 , -Ci) • (2) 

4 The nontrivial asymmetric properties of the distribution V(x, y) leading to the market 
mill structure and z-shaped structure of the conditional mean response was analyzed in 
full details in pQ. 
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An approximate character of the symmetry with respect to rotations at mul- 
tiples of 7r/2 shows itself in varying degree of proximity of the corresponding 
distributions. To give a quantitative estimate of this proximity we consider 
three bivariate probability distributions V w ' 2 (x,y), V 7T (x,y) and V 3n ^ 2 (x,y) 
obtained by rotating the original distribution V(x, y) at an angle fa — i -ir/2, 
where % = 1,2, 3. To compute a distance between two matrices corresponding 
to distributions V^ i {x,y) and V^ j (x,y) obtained by rotations of V(x,y) at 
fa and <pj respectively we flatten each of them into a vector, normalize it 
and compute a distance Dx(fa,fa) = dist t x (V^, ) between these vectors 
using the L\ ("Manhattan") metric 5 . We find 

£>i(0,7r/2) = £1(0,371-/2) = Di(7r/2,3vr/2) = D^ir, 3tt/2) (3) 
£>i(0,tt) = D 1 (n/2,3n/2) (4) 

and 

gi(M) ^i(vr/2,37r/2) 

A(0,7r/2 L>i(7r/2,7r) ' 1 ] 

Therefore we see that the rotation at 7r is a "better" symmetry of the full dis- 
tribution than the rotation at tt/2 implying, in turn, that equality V((\, (2) — 
V(—(i, — C2) holds to a better accuracy than V((i, C2) = ^(—(2, (2)- 



2.2.2 Geometry of response profile 

A detailed view on the distribution V(x, y) is provided by examining the 
corresponding conditional distributions such as, e.g., V(y\x) = V(x,y)/V(x) 
describing the probabilistic shape of response y at given push x. In Fig. 3 
we plot three cross-sections of the surface \ogV(x,y) corresponding to x — 
$ 0.01,0.07 and 0.25. We observe a clear change in the structure of the 
response with growing push. Qualitatively this change can be described 
by evolution of the parameter a in the stretched exponential distribution 
T > a(x)(y) — Af(at(x)) exp [— (\y\/a) a ( xS) /a(x)~\ from a(x) ~ 1 at small |x| to 
a(x) ~ 2 at large \x\, so that the distributions looks evolving from the 
squeezed tent-like at small pushes to almost gaussian at large ones. Note 
that this interpretation is consistent with the suggested description of the 
geometry of equiprobablity lines in Eq. (JTJ). 

5 In this estimate we restrict our consideration to the domain {\x\ < $ 0.3, \y\ < $ 0.3} 
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Response profiles 
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Figure 3: Response profiles of V(x,y) for x = $0.01, 0.07 and 0.25 
2.2.3 Moments of conditional distribution 

Let us first consider a useful quantitative characteristics of a shape of the 
distribution V(y \ x), the conditional mean absolute response 



The dependence of (\y\) x on the push x is plotted in Fig. 4. We see that 
to a good accuracy the mean absolute response is linear in the absolute 
value of the push, (|y|)a;OCCo + ci|a;|. Let us recall that the mean response 
(y) x is a nonlinear function of the push x \V\. As the absolute response is 
a robust measure of volatility, in Fig. 4 we have an example of conditional 
volatility smile or dependence volatility smile that was studied, in terms of 
a standard deviation of normalized returns, in [8 . The dependence of (\y\) x 
on x describes how much of response volatility is created for a given push. 
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Conditional mean absolute response 
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Figure 4: The conditional mean of absolute increment versus the initial push 



Because of the sensitivity of the mean conditional absolute response to 
the higher moments of the conditional distribution one can, by comparing 
it to the value obtained for the Gaussian distribution with the same stan- 
dard deviation, gauge the deviation of the distribution in question from the 
Gaussian Let us thus consider the following ratio: 



Pa 



<M>r 



(7) 



where (\y\)x = a/2/7T(t x and a x is an observed standard deviation of the 
corresponding conditional response. The ratio p x is plotted, in three cases 



10 



of consecutive 1 - minute, 3 - minute and 6 - minute intervals in Fig. 5 The 




Figure 5: Relative distance from the Gaussian distribution p x 

pattern seen in Fig. 5 unambiguously shows the progressive "gaussization" of 
the response profile with growing push. This is a highly nontrivial property 
of the distribution V(x,y). The same question can be addressed by com- 
puting the anomalous kurtosis of the conditional distribution. The results 
obtained for anomalous kurtosis support the conclusion on "gaussization" 
but are rather noisy, especially for the case of 1 minute intervals. 

The mean absolute response by default characterizes the symmetric com- 
ponent of the conditional probability distribution V(y\ x) with respect to the 
axis y = 0. Asymmetry of V(y\ x) with respect to this axis is characterized 
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by its odd moments. The first moment, the mean conditional response, was 
studied in detail in PQ. It was shown that the mean conditional response has 
a nontrivial zizgag-shaped dependence on the push. To probe the asymmet- 
ric contributions of higher order, let us consider the skewness V(x) of the 
conditional response 6 



where o v x is the conditional standard deviation of response at given push. 
In Fig. 6 we plot the skewness of the response for the same set of consecu- 
tive time intervals. The pattern seen in Fig. 6 corresponds to an interesting 
phenomenon. The asymmetry of the distribution of conditional response 
characterized by skewness has the sign of the initial push, so that for nega- 
tive pushes the response distribution is always negatively skewed, etc. Let 
us note that the same conclusion can be reached by considering a robust 
characteristics of distribution asymmetry, a difference between the median 
of the distribution and its mean. 

The generic symmetry properties of the distribution V(x, y) are best re- 
vealed by considering its symmetry with respect to the axes y = 0, y = 
x, x = and y = —x (patterns I - IV) [Tj. The patterns are of two types: 
pattern I is equivalent to pattern III and pattern II is equivalent to pattern 
IV. Pattern I was analyzed above, so let us consider pattern II. To analyze 
the symmetry properties of the distribution V(x,y) with respect to the axis 
y = x it is convenient to introduce the new variables 



so that in this case one deals with the conditional distribution V(z\z). 

The dependence of the shape of V{z\z) on the "push" z can again be 
explored by considering the ratio 



The ratio p z is plotted in Fig. 7. The generic pattern is the same as in Fig. 5: 
the distribution V(z\z) is progressively more and more gaussian with growing 
\z\. The shape is somewhat different though, so that the gaussization process 
is different in this case. 

6 Here we assume that the third moment of the conditional distribution exists. 





z = x + y and z = y — x 



(9) 



Pz 




(10) 
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Figure 6: The conditional skewness versus the initial push 

3 Discussion 

In the previous section we discussed a number of properties characterizing 
the probabilistic dependence patterns relating stock price increments in con- 
secutive time intervals. Our approach is based on the direct analysis of the 
bivariate probability distribution dependent on the two price increments in 
question V(x,y) and of the corresponding conditional distribution V(y\x). 
We have concentrated on high frequency data with increments in time inter- 
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vals of length AT = 1 min., 3 min. and 6 min 7 . 

3.1 Gaussization of conditional distribution at large 
magnitudes of price increments 

Let us first discuss in some more details a remarkable property of gaus- 
sization of multivariate distributions of price increments far away from the 
origin in the xy plane. A first hint comes from the analysis of the geometry 
of equiprobability levels of the push - response bivariate probability distri- 
bution V(x,y). As seen in Fig. 1 and sketched in Fig. 2, the geometry of 
equibrobability lines changes from rhomboid in the vicinity of the origin to 
circular far away from it, which is consistent with distribution changing the 
shape from bivariate Laplace to bivariate gaussian one. This is, however, not 
a proof of gaussization: a Laplace distribution with standard deviation of 
V(y\x) growing with \x\ leads to the same visual pattern. 

Quantitative proof of gaussization comes from considering the ratio p x 
defined in eq. (J7J) characterizing the shape of the conditional response distri- 
bution. In this ratio the effect of variable conditional standard deviation is 
factored out. We have checked the gaussization of the bivariate distribution 
V(x,y) along the two axes, y = and y = x (see Figs. 5 and 7). The speed 
of gaussization turns out different, but the phenomenon itself is undoubtedly 
present in both cases. 

3.2 Conditional dynamics: direct analysis of multivari- 
ate distribution vs regression models 

Knowledge of full bivariate distribution fully specifies corresponding condi- 
tional distributions and, therefore, a particular variant of conditional dynam- 
ics. As already mentioned in the introduction, the main body of research on 
conditional dynamics in financial markets was done within a paradigm of 
regression models [3I2I1I221I23I21I23I23I2I1I23I23I1I], in which con- 
ditional distribution for the forthcoming price increment depends on lagged 
increments and lagged conditional moments. In the simplest version of the 
AR(1)-ARCH(1) model [3122] the conditional distribution of the return r y 

7 The results of pQ show that it is reasonable to expect that all the features discussed 
in the present paper will hold at larger intraday time scales as well. 
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is gaussian, Law(r y ) = Vc(r y \ fi y , a y ), where conditional mean \i y and con- 
ditional standard deviation a y depend on return r x in the previous interval, 
fi x oc r x and a y = a + fir^. This setting is methodologically equivalent to 
the one considered in the present paper in the sense that the only informa- 
tion required for computing the conditional distribution for r y is the value 
of r x , so that the conditional dynamics of AR(1)-ARCH(1) is fully described 
by conditional distribution V(r y \r x ). To match the features described in 
the preceding |T] and present papers one would have to consider a nonlinear 
dependence of \i y on r x and construct a fairly complicated fat-tailed skewed 
conditional probability distribution. Even forgetting for a moment about the 
z-shaped dependence of mean conditional response on the push, meaningful 
comparison could be with a version of ARCH(l) with a fat-tailed skewed 
conditional distribution. As to the zigzag-shaped dependence of the mean 
conditional response on the push, the most natural treatment could be given 
within a class of threshold autoregressive (TAR) models fH\ . 

To compare our results with the properties of autoregressive models con- 
sidered in the literature let us consider the AR(1)-GARCH(1,1) model 13123], 
particularly its versions with fat-tailed [2H| and conditionally fat-tailed and 
skewed conditional distributions for residuals [23 [211 123 123 EDI- In these 
models conditional volatility is a function of both lagged returns and volatil- 
ity, so a comparison with the results obtained using the bivariate distribution 
V(x, y) is not direct. With this remark in mind, let us make some comparison 
at the "moment by moment" basis. 

• The conditional mean in the AR(1) model is by default a linear function 
of the push, fi x oc r x . This is equivalent to an assumption of the 
ellipticity of the underlying bivariate distribution V(r x ,r y ). As shown 
in PQ, the intraday conditional dynamics is characterized by a fairy 
complex nonlinear z-shaped pattern of mean conditional response. To 
take this into account one should generalize AR(1) to TAR(l). 

• The conditional standard deviation a y in GARCH(1,1) models is a 
growing function of r x , a y = const. + r\ + e a . This is consistent with 
the dependence smile discussed in jH] and shown in Fig. 3. 

• The results obtained in the framework of generalized GARCH models 
for the conditional skewness [23 1211 123 1231110] (only the daily timescale 
was considered) are somewhat contradictory. In ^3 123 123 EH] a con- 
clusion was that negative return is followed by negative conditional 
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skew - in agreement with the results of the previous section, while no 
conclusion on the sign of conditional skew following the positive return 
was reached. At the same time, the conclusion of [SUj was that the sign 
of conditional skew is always opposite to the sign of initial return. 

• The main result on conditional kurtosis obtained within the generalized 
GARCH approach was [28J that it is time dependent and not always 
existent. A comparison with our result on the progressive gaussization 
of the response distribution with growing magnitude of the push does 
not seem possible. 

Our method of directly analyzing the conditional distribution allows to 
describe its properties in a model-independent framework. If analyzed from 
the point of view of regressive conditional dynamics the results described in 
the present paper and in the preceding paper [T| can be formulated as follows. 
The conditional dynamics is 

• nonlinear 

• heteroskedastic as seen in the volatility dependence smile 

• not conditionally gaussian 

• characterized by conditional skew depending on the lagged increment 

• characterized by conditional fat-tailedness that diminishes with grow- 
ing magnitude of lagged increment 



4 Conclusions and outlook 

Let us formulate once again the main conclusions of the paper. Studying 
the geometry of the full bivariate probability distribution V(x, y) and the 
corresponding conditional distribution V(y\ x) we have found that 

• The shape of equiprobability lines of the bivariate probability distri- 
bution V(x,y) changes from roughly rhomboid in the vicinity of the 
origin to roughly circular far away from it. 

• The conditional distribution V(y\x) is shown to become progressively 
more gaussian at increasing push magnitudes. Analogous gaussization 
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takes place for conditional distribution considered with respect to the 
axis y = x. 

• The bivariate distribution V(x, y) is approximately invariant with re- 
spect to rotations at multiples of tt/2 

• The conditional mean absolute response is linear in the absolute value 
of push 

• The skewness of the response distribution inherits a sign of the push 

As was emphasized above in jT] and the present paper we study a com- 
bined ensemble of all pairs of consecutive price increments from all stocks. 
How is this overall geometry related to the geometric properties of individual 
stocks? Only after answering this question can one come close to describing 
the microscopic mechanism underlying the uncovered probabilistic depen- 
dence patterns. This issue is analyzed in the companion paper jU]. 

The work of A.L. was partially supported by the Scientific school support 
grant 1936.2003.02 

5 Appendix 

Below we list stocks studied in the paper: 

A, AA, ABS, ABT, ADI, ADM, AIG, ALTR, AMGN, AMD, AOC, APA, 
APOL, AV, AVP, AXP, BA, BBBY, BBY, BHI, BUB, BJS, BK, BLS, BR, 
BSX, CA, CAH, CAT, CC, CCL, CCU, CIT, CL, COP, CTXS, CVS, CZN, 
DG, DE, EDS, EK, EOP, EXC, FCX, FD, FDX, FE, FISV, FITB, FRE, 
GENZ, GIS, HDI, HIG, HMA, HOT, HUM, JBL, JWN, INTU, KG, KMB, 
KMG, LH, LPX, LXK, MAT, MAS, MEL, MHS, MMM, MO, MVT, MX, 
MYG, NI, NKE, NTRS, PBG, PCAR, PFG, PGN, PNC, PX, RHI, ROK, 
SOV, SPG, STI, SUN, T, TE, TMO, TRB, TSG, UNP, UST, WHR, WY 
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Figure 7: Relative distance from the Gaussian distribution in rotated coor- 
dinates p z 



21 



